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Background of the Invention 
This invention relates to protein interaction detection systems. 
Genetic analysis is a tool for understanding the protein networks that govern 
biological processes. The manipulations performed by geneticists (e.g., staging of 
temperature sensitive mutants, construction and analysis of double mutants, and 
generation and observation of Fl and F2 progeny) define relationships between genes. 
These abstract relationships between genes often reflect underlying biological realities. 
For example, the epistasis relation may suggest that one gene normally acts on another to 
cause a phenotype, the allelic specific suppression relation may suggest that two gene 
products physically interact (Hartman and Roth, Adv. Genet. 17:1-105 (1973); Jarvik and 
Botstein, Proc. Natl Acad. Set USA 70:2046-50 (1973)), and the dependency relation 
may suggest that the action of one gene product precedes that of another in time 
(Hereford and Hartwell, J. Mol Biol 84:445-61 (1974)). 

Information obtained from these genetic manipulations is typically of very high 
quality, but is often relatively difficult to acquire. The recent increase in the rate of 
identification of new coding sequences has renewed interest in global systematic methods 
to understand gene function. These methods include the "two hybrid" or "interaction 
trap" methods which have been developed to assay contact between a single bait and 
interacting proteins (Fields and Song, Nature 340:245-6 (1989); Chien et al. ? Proc. Natl 
Acad. Sci. USA 88:9578-82 (1991); Gyuris et ah, Cell 75:791-803 (1993); Durfee et al. ? 
Genes Dev. 7:555-69 (1993); Estojak et al, Mol. Cell Biol 15:5820-9 (1995). Contact 
between two proteins in these systems defines a physical relationship that is frequently of 
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biological significance. 

Summary of the Invention 
In general, the invention features a method for detecting a protein-protein 
interaction, involving: (a) providing a host cell which contains (i) a first reporter gene 
operably linked to a DNA sequence that includes a first protein binding site; (ii) a second 
reporter gene operably linked to a DNA sequence that includes a second protein binding 
site; (iii) a first fusion gene which expresses a first fusion protein, the first fusion protein . 
including a first protein covalently bonded to a binding moiety which is capable of 
specifically binding to the first protein binding site; (iv) a second fusion gene which 
expresses a second fusion protein, the second fusion protein including a second protein 
covalently bonded to a binding moiety which is capable of specifically binding to the 
second protein binding site; and (v) a third fusion gene which expresses a third fusion 
protein, the third fusion protein including a third protein covalently bonded to a gene 
activating moiety; (b) measuring expression output of the first reporter gene as a measure 
of the interaction between the first and the third proteins; (c) measuring expression output 
of the second reporter gene as a measure of the interaction between the second and the 
third proteins; and (d) interpreting the expression output results of step (b) and step (c), 
whereby (i) increased output of both of the first and the second reporter genes indicates 
that the third fusion protein interacts with both of the first and the second fusion proteins; 
(ii) increased output of the first reporter gene but not the second reporter gene indicates 
that the third fusion protein interacts with the first fusion protein but not the second fusion 
protein; (iii) increased output of the second reporter gene but not the first reporter gene 
indicates that the third fusion protein interacts with the second fusion protein but not the 
first fusion protein; and (iv) no change in output in either of the first or the second 
reporter genes indicates that the third fusion protein does not interact with either of the 
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first or the second fusion genes. 

In a preferred embodiment, the method further involves comparing the expression 
output results of step (b) and step (c) with the expression output result measured in either 
(a) a first comparison host cell which contains (i) a reporter gene operably linked to a 
DNA sequence including a protein binding site; (ii) a first fusion gene which expresses a 
first fusion protein, the first fusion protein including the first protein covalently bonded to 
a binding moiety which is capable of specifically binding to the protein binding site; and 
(iii) a second fusion gene which expresses a second fusion protein, the second fusion 
protein including the third protein covalently bonded to a gene activating moiety; or (b) a 
second comparison host cell which contains (i) a reporter gene operably linked to a DNA 
sequence including a protein binding site; (ii) a first fusion gene which expresses a first 
fusion protein, the first fusion protein including the second protein covalently bonded to a 
binding moiety which is capable of specifically binding to the protein binding site; and 
(iii) a second fusion gene which expresses a second fusion protein, the second fusion 
protein including the third protein covalently bonded to a gene activating moiety; or (c) 
both of the first and the second comparison host cells. In addition, any number of 
comparisons may be made between multiple "two-bait" cells or between a "two-bait" cell 
and multiple "one-bait" cells, or both. 

In other preferred embodiments, at least one of the first or the second reporter 
genes may be reduced in expression level; one of the first or the second protein binding 
sites is a tetracycline operator; one of the first or the second reporter genes is URA3 or 
lacZ; one of the first or the second reporter genes produces a signal that is received and 
detected by a second cell; the host cell is a yeast cell or a mammalian cell; the first and 
the second reporter genes may be expressed simultaneously; and the first protein and the 
second proteins are allelic variants. 

In a second aspect, the invention features a method for detecting a protein that 
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mediates a change in the state of another protein, involving: (a) providing a host cell 
which contains (i) a reporter gene operably linked to a DNA sequence including a protein 
binding site; (ii) a first fusion gene which expresses a first fusion protein, the first fusion 
protein including a first protein covalently bonded to a binding moiety which is capable 
of specifically binding to the protein binding site; and (iii) a second fusion gene which 
expresses a second fusion protein, the second fusion protein including a second protein 
which is capable of interacting with the first protein and which is covalently bonded to a 
gene activating moiety, wherein at least one of the first or the second proteins may exhibit 
a change in state; (b) allowing the first and the second proteins to interact; (c) measuring 
expression of the reporter gene as a measure of the interaction between the first and the 
second proteins; (d) introducing into the cell a third gene expressing a third protein; (e) 
measuring expression of the reporter gene, a change in the reporter gene expression in the 
presence of the third protein being an indication that the third protein mediates a change 
in the state of the first or the second protein leading to an alteration in the ability of the 
first protein and the second protein to interact. 

In a related aspect, the invention features an alternative method for detecting a 
protein that mediates a change in the state of another protein, involving: (a) providing a 
first host cell which contains (i) a reporter gene operably linked to a DNA sequence 
including a protein binding site; (ii) a first fusion gene which expresses a first fusion 
protein, the first fusion protein including a first protein covalently bonded to a binding 
moiety which is capable of specifically binding to the first protein binding site; and (iii) a 
second fusion gene which expresses a second fusion protein, the second fusion protein 
including a second protein which is capable of interacting with the first protein and which 
is covalently bonded to a gene activating moiety, wherein at least one of the first or the 
second proteins may exhibit changes in state; (b) allowing the first and the second 
proteins to interact; (c) measuring expression of the reporter gene in the first host cell as a 
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measure of the interaction between the first and the second proteins; (d) providing a 
second host cell which contains (i) the reporter gene operably linked to the DNA 
sequence including the protein binding site; (ii) the first fusion gene which expresses the 
first fusion protein; and (iii) the second fusion gene which expresses the second fusion 
protein; (iv) a third gene which expresses a third protein; (e) allowing the first and the 
second proteins to interact in the presence of the third protein; (f) measuring expression of 
the reporter gene in the second host cell as a measure of the interaction between the first 
and the second proteins in the presence of the third protein, a change in the reporter gene . 
expression in the second host cell as compared to that measured in the first host cell being 
an indication that the third protein mediates a change in the state of the first or the second 
protein resulting in an alteration in the ability of the first protein and the second protein to 
interact. 

In preferred embodiments of each of the above methods, the change in state is a 
conformational change; the protein exhibiting a conformational change is a Ras protein; 
the host cell is a yeast cell or a mammalian cell; the expression of the first fusion protein 
and the third protein occurs in response to an extracellular stimulus; and the reporter gene 
produces a signal that is received and detected by a second cell;. 

In another related aspect, the invention features a cell that includes (i) a first 
reporter gene operably linked to a DNA sequence including a first protein binding site; 
(ii) a second reporter gene operably linked to a DNA sequence including a second protein 
binding site; (iii) a first fusion gene which expresses a first fusion protein, the first fusion 
protein including a first protein covalently bonded to a binding moiety which is capable 
of specifically binding to the first protein binding site; (iv) a second fusion gene which 
expresses a second fusion protein, the second fusion protein including a second protein 
covalently bonded to a binding moiety which is capable of specifically binding to the 
second protein binding site; and (v) a third fusion gene which expresses a third fusion 
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protein, the third fusion protein including a third protein covalently bonded to a gene 
activating moiety. 

In preferred embodiments, at least one of the first or the second reporter genes may 
be reduced in expression level; one of the first or the second protein binding sites is a 
tetracycline operator; one of the first or the second reporter genes is URA3 or lacZ; one of 
the first or the second reporter genes produces a signal that is received and detected by a 
second cell; and the host cell is a yeast cell or a mammalian cell. 

In yet another related aspect, the invention features a reporter gene that includes a . 
tetracycline operator operably linked to a gene encoding a detectable product (for 
example, a URA3 gene or a lacZ gene). 

In a final aspect, the invention features a method for detecting whether a candidate 
protein interacts with a transcriptional activator, involving: (a) providing a host cell which 
contains (i) a reporter gene which can be reduced in expression level, the reporter gene 
being operably linked to a DNA sequence including a protein binding site; (iii) a first 
fusion gene which expresses a first fusion protein, the first fusion protein including the 
transcriptional activator covalently bonded to a binding moiety which is capable of 
specifically binding to the protein binding site; and (v) a second fusion gene which 
expresses a second fusion protein, the second fusion protein including the candidate 
protein covalently bonded to a gene activating moiety; (b) detecting an increase in 
expression of the reporter gene as an indication of an interaction between the candidate 
protein and the transcriptional activator. 

In preferred embodiments, the reporter gene is a URA3 gene; the expression level 
is reduced by 6-azauracil; the protein binding site is a tetracycline operator; the 
expression level is reduced by tetracycline or a tetracycline derivative; and the host cell is 
a yeast cell or a mammalian cell. 

By a "reporter gene" is meant a nucleic acid sequence whose expression may be 
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assayed; such genes include, without limitation, lacZ, amino acid biosynthetic genes, and 
the mammalian chloramphenicol transacetylase (CAT) gene. Reporter genes may be 
assayed, for example, by a change in cell viability or color reaction. 

By "expression output" is meant any measurable change in reporter gene 
expression. 

By a "protein binding site" is meant a nucleic acid sequence that may be 
recognized and bound by a protein or peptide. 

By a "transcriptional activator" is meant an amino acid sequence that is capable of - 
increasing expression of a gene to which it is bound. A "gene activating moiety" is all or 
a portion of such a transcriptional activator that is capable of increasing gene expression. 

By a "tetracycline derivative" is meant a compound that is related to tetracycline 
and that is capable of inhibiting the binding of a tetracycline repressor to a tetracycline 
operator. An exemplary tetracycline derivative is anhydrotetracycline. 

By "a change in state" is meant any physical or chemical change in a protein that 
alters its activity. Examples of changes in state include changes in phosphorylation 
patterns or conformation. 

Other features and advantages of the invention will be apparent from the following 
detailed description, and from the claims. 

Detailed Description 
The drawings will first be described. 

Description of the Drawings 
FIGURES 1A-1C is a series of three panels which schematically illustrate an 
exemplary two-bait interaction trap of the invention. In Cell 1 (Fig. 1 A), on glucose 
medium, a trans crip tionally-inert tetracycline repressor (TetR) fusion (Baitl, Ba^ and a 
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LexA fusion (Bait2, Ba 2 ) are bound to respective Tet and LexA operators upstream of 
reporters. The potentially interacting protein, fused to the transcription activation domain 
B42 is not expressed (Gyuris et al., Cell 75:791-803 (1993)). In this system, neither 
reporter gene is activated, the cell grows on 5-fluoroorotic acid (5-FOA) medium and 
does not grow on uracil" medium, and is white on 5-bromo-4-chloro-3-indolyl-f3-D- 
galactoside (Xgal) medium. In Cell 2 (Fig. IB), on galactose and raffinose medium, Prey 
1, a protein fused to B42, is expressed, but does not interact with either bait. The 
reporters are not activated, the cell grows on 5-FOA or does not grow on uracil" medium, - 
and is white on Xgal. In Cell 3 (Fig. 1C), on galactose and raffinose medium, another 
prey, Prey 2, is expressed and interacts with Ba b but not Ba 2 . In this cell, the URA3 
reporter is selectively expressed, the cell grows on uracil" medium but is white on Xgal. 

FIGURE 2 is a table of results showing cells that register the logical "And" and 
discrimination relationships ("Lsl" and "Ls2") on interaction relationships A! and A 2 . 
Cells contain the indicated bait and prey proteins; preys are only expressed on galactose 
and raffinose medium. Reporter output is indicated by growth or blue color on the 
indicated medium. For the And relationship, the prey protein contacts both baits and 
activates both reporters (TetOp-URA3 and LexOp-LacZ). For the Lsl relationship, the 
prey interacts only with the LexA fusion bait, and activates only the LexOp-lacZ reporter. 
Similarly, in the Ls2 relationship, the prey contacts only the TetR fusion bait, and 
activates only the TetOp-URA3 reporter. 

FIGURES 3 A-3C are a series of results showing cells that register the logical And 
operation on input proteins. Figure 3 A is a table showing expression of the LexOp-LacZ 
reporter in CWXY2 cells 1 and 2; expression is indicated by the intensity of the blue 
color on LHWK/Gal+Raff+Xgal plates (see below) and by their p-galactosidase activity 
(indicated in Miller units). Figure 3B is a schematic illustration of two cells with either 
TetR-Cdc25 (907-1589) or TetR-GAP, on Gal+Raff+Xgal medium (see below). In Cell 1 
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on the Gal+Raff plate, the TetR-Cdc25 loads LexA-RasA 186 with GTP, allowing binding 
of more B42-c-Rafl and increased transcription of the LexAOp-lacZ reporter. In Cell 2, 
TetR-GAP stimulates Ras GTPase activity so that more LexA-Ras is in the GDP-form, 
resulting in less B42-c-Rafl binding, and decreased transcription of the LexAOp-LacZ 
reporter. Figure 3C is a truth table depicting the results of operations on protein inputs. 
In this figure, "1" in the second column denotes respectively the presence of TetR-Gap or 
TetR-Cdc25, and "0" or "1" in the third column denotes respectively a low or a high 
output of [3-galactosidase. 

FIGURE 4 is a schematic illustration of a cell that produces an output that may be 
read by the producer cell or a second, recipient cell. 

FIGURE 5 is a schematic illustration of a cell having four input-output channels. 

FIGURE 6 is a schematic illustration of a wholly biological circuit that acts as a 
transistor. 

As described herein, the present invention provides methods and reagents for 
detecting or registering complex relationships between two or more proteins. Many of 
these relationships, for example, "bridging" (or connecting) and "discriminating" 
interactions, are useful for understanding gene function. As demonstrated below, by 
performing logical operations on the phenotypic outputs of both new and existing two- 
hybrid cell interactions, detailed models of protein function in pathways and complexes 
can be determined. The protein relationships detected by these cells are analogous to 
classical genetic relationships such as epistasis, but they can be interpreted within a 
rigorous analytical framework, and they can be performed systematically on the products 
of entire genomes. In addition, cells that register such relationships can perform logical 
operations on protein inputs, and thus may define paths for the construction of biological 
computational devices. 
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Preliminary Considerations: Logical Protein Relationships in Two-Hybrid Systems 

In classical ("one bait") two-hybrid systems, the output of a reporter gene depends 
on the interaction between the DNA-bound bait and activation-tagged prey. Genetic 
markers expressed by some reporters, for example, URA3, allow selection against 
reporter transcription and thus for lack of interaction (Le Douarin et al, Nucl Acids Res. 
23:876-8 (1995)). The relationship between proteins in these systems can be described in 
symbolic-logical terms. 

By this view, contact between a bait (Ba^ and a prey (PJ defines a variable (A^, 
here called a contact relationship. Because A, is operationally defined by the reporter 
output, the term A { is used to refer to the reporter output as well. There are four possible 
Boolean operations, or functions, on this relationship (Schneeweiss, Boolean Functions: 
With Engineering Applications and Computer Programs (Springer- Verlag, Berlin, New 
York, 1989)). Two of these are constant functions: Fl (A^ = 0 and F2 (A x ) = 1, and two 
are true functions: F3 (A t ) = A x and F4 (Aj) = Not A. 

In one-bait systems, the phenotypic consequences of the reporter output can 
register two of these operations on the contact relationship, F3 and F4. This is shown in 
Table 1 . In this Table, the contact relationship between the Bait Ba t and the Prey Pj is 
denoted as A b and is measured by the output of the reporter. Thus, the values of Al (0 
and 1) refer to both the values of the contact relationship and to the off and on states of 
the reporter. "Value of A^' shows the two possible values (0 and 1 or off and on) of A t . 
"Operation" shows the two allowed operations (functions) on Aj. The two subcolumns of 
"Results of operation" refer to the output of the operations on Aj. "Alternative names" 
gives common names for the operations. "Interpretation" describes the state of 
interaction between the bait and prey proteins. "Defining phenotype" gives the 
phenotype for a contact relationship in the one bait system, and "Example biological 
correlates" gives examples of the biological circumstances that can result in such outputs. 
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In Table 1, both F3 and F4 have important biological correlates. Consider a one 
bait system, in which contact between Ba, and P, (for example, because these proteins 
heterodimerize) gives a positive output (blue color on X-Gal, growth on Ura _ medium), 
while loss of that contact (for example, by mutation or disruption by a peptide aptamer) 
(Colas et al., Nature 380:548-550 (1996)) gives a negative output (white color on X-Gal, 
growth on 5-FOA medium) (Table 1). A cell growing on 5-FOA medium is analogous to 
a device that performs the logical Not operation on the contact relationship, or, 
alternatively, as a cell that registers the state (Not A[). 

Cells that Detect More Complex Protein Relationships 

Cells were constructed that allowed simultaneous selection for and against two 
distinct protein-protein interactions (Figures 1A-1C). This two-bait interaction trap 
utilized the DNA binding moieties of LexA and TetR, the tetracycline repressor of 
bacterial transposon TnlO (Gossen and Bujard, Proc. Natl. Acad. Sci. 89:5547-51 
(1992)). The LexA and TetR fusion bait proteins were expressed in a yeast cell that also 
contained an integrated TetOp-URA3 reporter and an episomal LexAOp-lacZ reporter. 
Expression of baits in these "two-bait" cells was under the control of the ADH1 promoter. 
The TetOp-URA3 reporter was integrated into the LYS2 gene. The LexAOp-lacZ 
reporter, a derivative of pSH18-34, was carried on pCXW24, a 2um plasmid with a LYS2 
marker. The prey vector was pJG4-5, whose GAL1 promoter conditionally directed 
expression of an activation-tagged protein from a 2am TRP1 plasmid (Gyuris et al., Cell 
75:791-803 (1993)). 

Logical Operations on a Contact Relationship in a Two-Bait Interaction Trap 

In the two-bait cells, the two contact relationships (and the output of the 
corresponding reporters) were expressed as Boolean variables, A! and A 2 . There were 16 
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possible operations on these variables (Schneeweiss, Boolean Functions: With ' 
Engineering Applications and Computer Programs (Springer-Verlag, Berlin, New York, 
1989)), four of which were registered in these cells. These operations were referred to as 
And, Nor, and the two discrimination operations, logic state 1 (Lsl) and logic state 2 
(Ls2). And, Lsl, and Ls2 were considered to represent particularly useful operations for 
determining protein function. A summary of these operations is shown in Table 2. In 
that Table, the contact relationship between Ba, and P[ is denoted as A,, and is measured 
by the output of the TetOp-URA3 reporter. The contact relationship between Ba 2 and P, 
is denoted as A 2 , and is measured by the output of the LexAop-lacZ reporter. Again, 
values Ai and A 2 (0 and 1) are used to refer to both the values of the contact relationship 
and to the values (off and on states) of the reporter. The four subcolumns of "Value of 
variables" denote the four different possible combinations of values for the two variables. 
"Operation" shows the four operations on these variables possible in this system. 
"Alternative names" gives common names for the operations. "Interpretation" describes 
the state of interactions between the bait and prey proteins. "Defining phenotype" gives 
the phenotype for touching relationship in the one bait system, and "Example biological 
correlates" gives examples of the biological circumstances that can result in such outputs. 

To test the general utility of the system, a set of illustrative proteins were used, 
called RasA 186 , RasA 15 A 186 , RasV 12 A 186 , GAP, hSosl (residues 601-1019), Cdc25 
(residues 907-1589), c-Rafl (residues 1-313), Max, and Mxil. The Harvey ras gene 
product, Ras, is a GTPase that exists in two distinct conformations, a GDP-bound 
(inactive) form and a GTP-bound (active) form. The fact that Ras cycles between these 
conformations allows it to be a switch protein in signal transduction pathways that control 
cell proliferation (Boguski and McCormack, Nature 366:643-654 (1993)). All Ras 
proteins described herein contained a Cys to Ala change at 1 86, which inactivated a 
farnesylation site and increased the apparent nuclear concentration of the protein. 
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RasA 186 exists in a mixture of GDP and GTP bound forms, while RasA 15 A 186 and 
RasV 12 A 186 are predominantly in the GDP and GTP bound forms, respectively. GAP, 
which stimulates GTPase activity of Ras, binds to GTP-bound Ras. c-Rafl, a 
downstream target of Ras, also binds GTP-bound Ras. By contrast, hSosl and Cdc25, 
both of which activate Ras, only bind to GDP-bound Ras. Max and Mxil, which 
heterodimerize tightly, were used as positive controls (Zervos et al., Cell 79:388 (1993)). 

Figure 2 shows that the two-bait cell registered the logical And operation. The 
relationship is fulfilled by proteins, such as bridging proteins in pathways, that can 
interact with both baits. In this experiment, yeast strain CWXY2 carried B42-RasA 186 as 
prey, TetR-c-Rafl and LexA-hSosl as the baits, and TetOp-URA3 and LexAOp-LacZ as 
the reporters. B42-RasA I86 interacted with both TetR-c-Rafl and LexA-hSosl. The cell 
was blue on Xgal and grew on medium lacking uracil. This bridging or connecting 
relationship was consistent with the idea that Ras interacted with both proteins, and 
suggested that proteins that activate both reporters can be selected from genome- wide 
screens. 

In addition, Figure 2 shows that the two-bait system also registered Lsl and Ls2, 
the discrimination relationships. These relationships are expected for proteins that 
interact with one bait but not another. Here, in cells expressing TetR-RasV 12 A 186 and 
LexA-Max baits and a B42-c-Rafl prey, the Ras-Raf interaction activated the TetOp- 
URA3 reporter, and the cells grew in medium lacking uracil; Raf did not interact with 
LexA-Max and thus did not activate the lacZ reporter. On the other hand, in a cell 
expressing TetR-RasV 12 A [86 and LexA-Max baits and a B42-Mxil prey, the Max-Mxil 
interaction activated the LexAop-LacZ reporter, and the cells turned blue on Xgal; Mxil 
did not interact with Ras and thus did not activate the URA3 reporter. 

Moreover, the results in Figure 2 indicated that these cells could discriminate 
between two closely related allelic variants. In a cell expressing TetR-RasA 15 Ai 86 , LexA- 
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RasV 12 A| 86 , and B42-c-Rafl, the Raf/RasV 12 A 186 interaction activated expression of the 
LexAop-LacZ reporter; the cells turned blue on Xgal but did not grow on medium lacking 
uracil. By contrast, in a cell expressing TetR-RasA I5 A 186 , LexA-RasV 12 A 186 , and B42- 
Cdc25, the Cdc25/RasA 15 A 186 interaction activated the TetOp-URA3 reporter, and 
allowed the cells to grow on medium lacking uracil but caused it to remain white on Xgal. 
This result suggested that these cells could identify, from genomic or combinatorial 
libraries, proteins and peptides that interact with allelic protein variants specific for 
disease states. 



Identification of Peptide Aptamers That Discriminate Between Allelic Variants 

A two-bait cell that contained TetR-RasV 12 and LexA-RasA t5 was used to isolate 
members of a peptide aptamer library that interacted specifically with RasV 12 . URA + 
library transformants were screened for lacZ" cells, which presumably contained aptamers 
that did not interact with LexA-RasA 15 . Plasmids encoding aptamers were then rescued 
from these lacZ + cells and their phenotypes reconfirmed. Using this system, two 
discriminatory aptamers, Pep22 and Pep 104, were identified. Pep22 interacted with both 
RasV 12 and RasA i5 , whereas, by contrast, Pepl04 interacted only with RasV 12 . In 
particular, the Pep22 -containing cell grew on Ura _ medium and was blue on X-gal 
medium. The Pepl04-containing cell grew on Ura _ medium but was white on X-gal 
medium. These results demonstrated the utility of this system in selection of specific 
peptide aptamers. For Pep22, the second bait increased the selectivity of the system by 
eliminating potential false positives that might arise from artifactual activation of a single 
reporter. For Pep 104, the second bait allowed detection of aptamers specific for an allelic 
form of the protein active in signal transduction. The sequences of Pep22 and Pep 104 are 
DMDWFFRFYA SVSRLFRHLH (SEQ ID NO: 15) and FWQATLRLVS DKLILLYPDP 
(SEQ ID NO: 16), respectively. 
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Logical Operations on Protein Inputs 

Cells can also perform logical operations on protein inputs, and register the result 
of those operations by changes in output. Figures 3A-3C show a logical And operation 
on protein inputs. In a cell expressing LexA-RasA 186 , B42-c-Rafl, and TetR-GAP, the 
output of the LexAop-lacZ reporter was low (light blue on Xgal medium), presumably 
because GAP drove most of the Ras to the GDP-bound conformational state. By contrast, 
input of TetR-Cdc25 increased the RasAi 86 /c-Rafl interaction, as shown by the intensity 
of blue color on Xgal plates, presumably by changing Ras into the Raf binding 
conformation. In this experiment, the cell had two inputs, one of which, B42-c-Rafl 
(logical value 1) was constantly present, while the other was either TetR-GAP (logical 
value 0) or TetR-Cdc25 (logical value 1); and the output was either high (1) or low (0). 
In this case, the output of the LexAop-lacZ reporter was thus controlled by a LexA-Ras 
switch protein whose conformation depended on the values of the inputs. 

In an alternative to the above system, Cdc25 may be replaced with hSosl, and 
similar results obtained. In addition, expression of system components may be further 
regulated. For example, in the systems described above, B42-c-Rafl may be expressed 
using a GAL1 promoter, and/or Cdc25 and hSosl may be expressed from a promoter 
normally repressed by TetR-Sin3. This results in expression of B42-c-Rafl being 
dependent on the presence of galactose in the growth medium, and expression of Cdc25 
and hSosl being dependent on the presence of tetracycline or a tetracycline derivative in 
the growth medium. In these systems, the cell is responding to two distinct extracellular 
inputs, one that controls the expression of the modifying protein and the other that 
controls expression of the prey. 

Protein Relationships in Two Hybrid Systems 

Based on the experiments described above, the outputs of reporter genes in one 
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bait two-hybrid systems may be viewed as reflecting two basic logical states. The 
activation of the reporter gene embodies the contact relationship A] between the bait 1 
(Ba,) and prey (Pi). Not A, (between Ba! and P,) defines the lack of interaction, caused, 
for example, by a mutation in the interacting partners, or disruption of the interaction by a 
third protein or peptide aptamer (Colas et al., Nature 380:548-550 (1996)). 



Construction of Cells with Independently Functional Interaction Reporters 

As described above, cells were constructed that detected more complex protein 
relationships by making a version of the interaction trap that utilized the DNA binding 
moieties of LexA and TefR, the tetracycline repressor of bacterial transposon TnlO. 
Fusion proteins containing these moieties were expressed as two baits in a cell that also 
contained TetOp-URA3 and LexAop-lacZ reporters. This system allowed simultaneous 
determination of two genetic interactions in a single cell. 

The inclusion of TetR baits and TetOp-URA3 reporters significantly facilitated 
interaction trap applications. The phenotype dependent on the TetOp-URA3 reporter was 
more sensitive than that of lacZ and LEU2 reporters (see, for example, Gyuris et al., Cell 
75:791-803 (1993) and Estojak et al., Mol. Cell. Biol. 15:5820-9 (1995)) which facilitates 
detection of weak interactions. In addition, both the URA3 and LacZ reporter genes may 
be quantitatively assayed (Shostak et al., Anal. Biochem. 191:365-9 (1990)). Moreover, 
the sensitivity of this URA3 reporter can be down-regulated in two ways. Expression of 
the URA3 reporter can be titrated by 6-azauracil, an inhibitor of the URA3 gene product 
(orotidine-5 '-monophosphate decarboxylase (OMPdecase) (Le Douarin et al., Nucl. Acids 
Res. 23:876-8 (1995). Reporter activity can also be reduced by tetracycline or its 
derivatives (for example anhydrotetracycline), which disrupt binding of the TetR bait to 
Tet operators. Preferably, such compounds are used at a concentration of up to 100 u.g/ml 
on plates. Both kinds of inhibitors diminish the sensitivity of the URA3 reporter, 
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allowing its use with baits that activate transcription and allowing its use, along with lacZ, 
for crude estimation of interaction affinities. Moreover, the URA3 reporter allows the use 
of 5-FOA to select against gene expression (Boeke et ah, Mol Gen.Genet 197:345-6 
(1984)). In this case, tetracycline and 6-azauracil can regulate the threshold amount of 
transcription selected against, facilitating the selection of peptide aptamers that break 
specific protein interactions. 

Logical Analysis of Binary and Higher-Order Protein Relationships 

As shown in Table 2, the two bait cells registered four logical relationships, Nor, 
And, Lsl, and Ls2. Three are particularly important. The And relationship (A { (between 
Baj and Pj) and A 2 (between Ba 2 and P^ was found for prey proteins (connecting 
proteins) that contacted both baits (Table 2). Identification of such proteins is useful for 
continued construction of dense charts of genetic networks and for connecting pathways 
not previously known to be related. Such sets of interacting proteins are sometimes 
referred to as "protein contigs." 

Lsl and Ls2, the discrimination relationships, were also important. These 
relationships are relatively complex: for example, the Lsl relationship involves two 
operations on two interactions: Not A { (between Ba } and ?{) and A 2 (between Ba 2 and Pj). 
These operations have numerous biological correlates, in that a cell that registers this 
relation allows detection of proteins that interact differently with unrelated proteins, 
allelic variants, and different conformational states (Figure 2), and also allows detection 
of proteins that interact differentially with different modification states. The use of these 
relationships to survey the products of combinatorial libraries and genomes allow 
selection of proteins that interact specifically with proteins encoded by disease-state 
alleles, or with proteins that differ from wild-type due to differential splicing or 
posttranslational modification. 
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Analysis of Higher-Order Protein Relationships 

The protein relationships that can be inferred from two-bait cells are not always 
identical to those revealed by one-bait cells. For example, if both Ba, and Ba 2 
oligomerize to form a surface that interacts with P, then neither the Ba,/P nor Ba 2 /P 
interaction will be detected in one-bait cells. Such differences in contact relationships are 
useful, since combining data from one- and two-bait cells allows the experimenter to 
make inferences about the topology, temporal sequence, and posttranslational 
modification dependent of the protein interactions. 

Table 3 shows inferences about physical interactions among three proteins, X, Y, 
and Z: (i) from possible outputs of a two-bait cell that detects contact relationships A l 
(between X and Z), A 2 (between Y and Z), and A 3 (between X and Y). In this Table, 
"reporter output" shows the state of the contact relationships registered by outputs of the 
reporters in three one-bait cells and a single two-bait cell. X and Z are fused to an 
activation domain to form preys [P(X) and P(Z)], and Y and X are fused to a DNA 
binding domain to form baits [Ba(X) and Ba(Y)]. In a two-bait cell, the outputs of the 
reporters show the state of the contact relationship for proteins X, Y, and Z where they 
are fused with one of two DNA binding domains [Ba,(X) or Ba 2 (Y)] and an activation 
domain [P(Z)]. "Physical interpretation" shows one possible biological interpretation of 
this set of reporter outputs for combinations of one- and two-bait data, or for one-bait data 
alone. Although all patterns in Table 3 may not have biological correlates, many have 
been observed, for example, the interaction of Baitj and prey depends on the presence of 
Bait 2 . Experiments such as those depicted in Table 3 indicating linkages of one- and two- 
bait data are useful in ordering the function of proteins in pathways and complexes. 

Application to the Analysis of Gene Function 

This two-bait system, particularly when combined with existing one bait systems, 
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thus extends the scope of yeast interaction technology to analyze the function of genes in 
pathways. It can aid the identification of proteins and peptide aptamers that distinguish 
between allelic variants of proteins. In addition, linkage of data from two-bait cells 
(likely to result from individual experiments) and from one-bait cells (perhaps obtained 
from genome- wide surveys) allows detailed analysis of protein function and contact 
topology in pathways and complexes. It also allows more precise analysis of the 
topology of proteins in multi-protein complexes, and it provides robust, scalable, semi- 
automatic ways to distinguish among alternative models of protein interactions. In 
addition, because the relationships among proteins defined in this way lend themselves to 
systematic analysis, they can be determined industrially. 

Towards Protein-and-Transcription Based Logical Circuitry 

The above experiments made use of a protein, Ras, that cycles between two 
conformational states, and an activation tagged protein, Raf, that binds Ras in one of 
these states. The state of the Ras switch, and its output measured by transcription, was 
shown to vary, depending on whether the input protein was GAP or Cdc25. In these 
experiments the cells were acting as And gates, in which one input, B42-c-Rafl, was held 
constant (logical value 1), the other was either Gap (logical value 0) or Cdc25 (logical 
value 1), and the phenotypes caused by expression of the reporters constituted the 
outputs. With B42-c-Rafl and GAP, the output was low p-galactosidase activity (0). 
With B42-c-Rafl and Cdc25, the output was high p-galactosidase activity (1). Although 
it was also scored by reporter expression, this And operation was not an operation on the 
contact relationship like those described earlier. Rather, the operation was on the protein 
inputs, and in this case the cells were acting as true logic gates. 

In the cells described herein, to change the inputs, different DNA constructions 
were employed that expressed interacting proteins; to measure output, human or other 
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observers were needed. Construction of wholly biological transcription-based" logic 
circuitry requires replacing these inputs and outputs with logical inputs that vary in 
response to extracellular stimuli, such as secreted peptide pheromones or light, and 
outputs that generate such stimuli. 

Exemplary systems employing such biological circuitry are shown in Figures 4-6. 
In particular, in Figure 4, a cell is depicted that produces an output that may be read by 
the producer cell or by a second, recipient cell in response to the input. As shown in 
Figure 4, addition of the input protein, yeast a-factor, results in the expression, through a . 
G protein pathway, of a TGF-p output, an output that may be received by the same or a 
second cell and that may produce a phenotypic change in the recipient cell. Figure 4 also 
depicts a system in which a TGF-P input protein binds to its receptor, resulting in the 
activation and translocation of a LexA-Madd (also referred to as a LexA-Smad) protein. 
This LexA-Madd protein triggers expression of a yeast a-factor output, a signal that again 
may be read by the producer cell or by a second, recipient cell and may produce a 
phenotypic change in the recipient cell. 

Figure 5 extends this system and illustrates a cell having four different input- 
output channels, thereby allowing for a variety of logical operations. In Figure 5, the 
input proteins a-factor, TGF-p, delta, and bradykinin interact with the a-factor receptor, 
TGF-P receptor, notch receptor, and bradykinin receptor, respectfully, to activate 
pathways that result in expression of output proteins that may be read by cellular systems, 
for example, as transcriptional changes. As the number of input and output options 
increases, the number of channels that may be programmed increases; for example, a cell 
having 10 input/output channels may be programmed in 2 10 , or more than 1000 different 
states. In one exemplary approach to increasing input/output channels, a protein, such as 
the LexA-Smad protein discussed above, may be engineered to contain different 
aptameric moieties, each conferring a different receptor specificity on the chimera. 



20 



Moreover, in these systems, the input, rather than being a protein, may be some other 
extracellular stimuli, such as light or a particular wavelength of light that triggers a 
specific photoreceptor pathway. Preferably, in such a system, the output is a fluorescent 
protein, for example, a green, red, or blue fluorescent protein. 

One final type of wholly biological circuit is depicted in Figure 6. This cell 
represents an exemplary "transistor." In this cell, in the presence of green light, a reporter 
gene is triggered to express an HIV protease. This protease then cleaves a target linker 
that joins together a blue fluorescent protein-green fluorescent protein chimera, releasing . 
both components, and resulting in a large decrease in green light output. 

The wholly biological logical devices described above might not be very fast. 
Although one of these switch proteins, Ras, can cycle in milliseconds and a number of 
signal transduction pathways can provide inputs within minutes (Bray, Nature 376:307- 
312 (1995)), reporter output may require minutes to hours to be detectable. However, 
because required construction work near the DNA may be performed using 
straightforward techniques, it is likely that gene expression will remain a useful output. 
Construction of such circuits is facilitated by the rapid increase in the number of natural 
and artificially selected protein domains with useful allosteric and targeting properties 
(Colas et al., Nature 380:548-550 (1996)). It is thus possible that such transcription- 
based technologies provide an early route to biological computation. 

Material and Methods 
The Two-Bait System 

In the experiments described above, the vectors pEG202 and pJG4-5 were utilized 
as Ba 2 and prey expression plasmids (Gyuris et al, Cell 75:791-803 (1993)). In addition, 
Ba! expression plasmid, pCWX200, LexAOp-LacZ reporter plasmid, pCWX24, and 
TetOp-URA3 reporter vectors were constructed and were integrated into Saccharomyces 
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cerevisiae strains to create CWXY1 and CWXY2 as follows. 

Construction of the Te fR-fusion protein expression verier. pCWX200. pCWX200 
carried a 2 urn replicator and LEU2 marker. The promoter and cloning region derived 
from pEG202 and the backbone from pBClOO, a derivative of Yeplacl81 (Gietz and 
Sugino, Gene 74:527-534 (1988)). To remove the pBClOO polylinker region (from Accl 
to EcoRI) as well as DNA encoding the amino-terminal fragment of p-galactosidase, the 
vector was digested with NarllAccl and religated to yield pCWXIOO. The Hindlll site 
was then deleted by digesting pCWXIOO with Hindlll, blunt-ending the overhangs with 
Klenow, and religating to yield pCWXIOOAtfzWIII. The Sphl fragment of pEG202, 
containing the ADH1 promoter, LexA, a polylinker, and the ADH1 terminator was then 
inserted into the Sphl site of pCWXlOOA/ZmJIII to yield pCWX150 (Gyuris et al., Cell 
75:791-803 (1993) and Estojak et al, Mol. Cell. Biol. 15:5820-9 (1995)). Finally, the 
EcoKUHindlll LexA fragment of pCWX150 was replaced with an EcoRl /Hindlll ended 
PCR fragment that encoded the Tet repressor (TetR) (Zervos et al., Cell 79:388 (1993)). 
The resulting plasmid, pCWX200, had cloning sites equivalent to those of pEG202 
(Gossen and Bujard, Proc. Natl Acad. Set 89:5547-51 (1992)). 

Construction of the Te xAop-lacZ reporter. pCWX24. The. T.pyAnp-1ar7 rpjvvrtrr, 
the LYS2 marker, the 2um replicator, and the rest of the pCWX24 plasmid backbone 
were derived from intermediate plasmids pCWX221, pCWXOl, and pCWX21, 
respectively. 

Construction of pCWX221 . To construct this plasmid, SH18-34T was digested to 
completion with Kpril, a partial digestion with BaniRl was performed, and the vector was 
ligated to a BamHUKpril digested backbone of pBluescript. The LexAOp-LacZ reporter 
was on the NotllKpnl fragment of pCWX221. 

Construction of pCWXOl. To construct pCWXOl, YeP426 was digested with Sail 
IClal (Ma et al., Gene 58:201-16 (1987)), and a -5600 bp Clal-Xbal fragment was 
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obtained that carried LYS2 (Barnes and Thorner, Mol. Cell. Biol. 6:2828-38 (1986)) and 
about 300 bp of the pBR322 tetracycline resistance gene. This fragment was ligated with 
the (pBluescriptAKprcl) pBSAKpnl backbone digested with ClallSaR, to generate 
pCWXOl. 

Construction of pCWX2 1 . pRF24 was digested with Noil and blunt-ended with 
Klenow to yield pCWX20AMtfI. To create pCWX21, pCWX20A7VM was digested with 
BamEI and ligated to self-annealed 5*-GATCCGCGGCCGCG-3' (SEQ ID NO: 1) to 
generate a new Notl site. 

Assembly of pCWX24. To assemble pCWX24 aNotl/Kpnl LexOp-LacZ fragment 
of pCWX221 was inserted into the Notl/Kpnl site of pCWX21 to create pCWX22. Next, 
a self-annealed 5'-CTAGGGCCCTAGCATG-3' fragment was inserted at the Sphl site of 
pCWX22 to generate an Apal site and the vector pCWX23. After digesting pCWX23 
wiUhApal/Pmel, a Small Apal digested LYS2 fragment of pCWXOl was inserted to create 
pCWX24. 

Construction of TetOp-URA3. The TetOp-URA3 constructs were generated from 
the intermediate plasmids, pCWX21 1, pCWX213T2, pCWX213T10, pCWX02T2, and 
pCWX02T10. 

Construction of pCWX21 1 . Following amplification of the URA3 gene from 
pSH200 with 5'-AAAAGGAAAAGCGGCCGCTTAGTTTTGCTGGCCGCATCTTC-3' 
(SEQ ID NO: 2) and 5-CGGAATTCTTTCGAAAGCTACATATAAGGAAC-3' (SEQ 
ID NO: 3), the NotVEcoRl ended PCR product was ligated to £coRI/JVM-digested pBS to 
generate pCWX211. 

Construction of pCWX213T2 and pCWX213T10. To construct these vectors, a 
BamEI fragment from pLRl Al, containing the yeast Gall /Gal 10 promoter region with a 
unique Xhol site (West et al., Mol. Cell Biol. 4:2467-78 (1984)), was inserted into the 
BamEI site of pBS to create pCWX210. Two strands were then synthesized and annealed 
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to create an XIwV Sail-ended DNA fragment that contained two Tet0 2 operators (Meier et 
al., EMBOJ. 7:567-72 (1988)), as follows (SEQ ID NOS: 4 and 5): 



TCGAGcactccctatcagtgatagagaaaacactccctatcagtgatagagaaaaG 
1 + + + + + 56 

CgtgagggatagtcactatctcttttgtgagggatagtcactatctcttttCAGCT 

After ligating the Tet operator DNA to itself, it was digested with Xhol/Sall and 
fractionated on a 0.8% agarose gel. DNA with sizes ranging from 100 bp to 200 bp was 
inserted at the Xhol site of pCWX210. Using this protocol, two plasmids, pCWX210T2 
andpCWX210T10, were obtained, which contained 4 and 6 Tet0 2 operators respectively. 
The £coRI-ended Gall/GallO promoter regions of pCWX210T2 and T10 were inserted 
into the EcoRl site of pCWX21 1 to generate pCWX21 1T2 and pCWX21 1T10 
respectively. In addition, a Kpnl site in these plasmids was created by inserting a self- 
annealed oligo (5'-GGCCGCGGTACCGC-3') (SEQ ID NO: 6) at the Notl site to create 
pCWX212T2 and pCWX212T10 respectively. In these vectors, the TetOp-URA3 
reporter was positioned on a Kpnl fragment. Deletion of the Clal site in these plasmids 
produced plasmids, pCWX213T2 and pCWX213T10. Replacement of the Kpnl fragment 
of LYS2 from pCWXOl with i^rcl-ended TetOp-URA3 constructs from pCWX213T2 
and pCWXTIO created pCWX02T2 andpCWX02T10. 

Integration of the TetOp -URA3 Reporter at LYS2. Following digestion of 
pCWX02T2 and pCWX02T10 with ClallXbal, these vectors were transformed into 
Saccharomyces cerevisiae strain EGY40 containing pCWX200Cdi2 (Gietz et al., Yeast 
11:355-60 (1995)), a vector which directed expression of the TetR-Cdi2 gene, a 
transcription activator (Finley and Brent, Proc. Natl. Acad. Set 91:12980-84 (1984)). 
Transformants were selected on medium lacking uracil and leucine. Ten independent 
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clones were picked for each pCWX02T2 and pCWX02T10 transfomation, and streaked 
them onto alpha-aminoadipic acid plates to select for those in which LYS2 was 
inactivated (Chattoo et al., Genetics 93:51-65 (1979)). Integration of the TetOp-URA3 
gene at the LYS2 locus was confirmed by PCR using primers which hybridized to the 
regions flanking the Kpnl fragment of LYS2. The resulting strains CWXY1 and CWXY2 
{MAT a ura3 his3 trpl leu2 lys2Akpn::TetOp-URA3) contained 4 and 6 Tet0 2 operators 
upstream of the URA3 gene, respectively. 

Assays for the And and Discrimination Relationships 

To assay interaction relationships, a series of vectors were constructed. c-Rafl (1- 
313) was cloned into the EcoKMXhol sites of vectors pCWX200 and JG4-5. Following 
amplification of RasA 15 A 186 , hSosl(601-1019), and Cdc25(907-1589) withRasA 15 A 186 
primers 5 -GCCTGAATTC ATG ACGG AATATAAGCTGG-3 ' (SEQ ID NO: 7) and 5'- 
CCCGAACTCGAGTCAGGAGAGCACTGCCTTGCAGC-3' (SEQ ID NO: 8), the 
hSosl (601-1019) primers 5-GCCTGAATTCAAAGCAGGAACTGTT-3' (SEQ ID NO: 
9) and 5-CCCGAACTCGAGCTATCGTGGTTCTATTTCTAG-3' (SEQ ID NO: 10), 
and the Cdc25 catalytic domain (907-1589) primers 

5'GCCTGAATTCATGTCTTCGGTCTCCTCAG-3' (SEQ ID NO: 1 1) and 5'- 
CCCGAACTCGAGTTATCGAAATAACCTAGAAGG-3' (SEQ ID NO: 12), the 
£coRI/A7zoI-ended PCR products of RasA 15 A 186 , hSosl(601-1019), and Cdc25(907-1589) 
were also cloned into the EcoRl/Xhol sites of vectors pEG202 and pJG4-5. Having 
transformed CWXY2 containing pCWX24 with the combinations of two baits and preys 
as above in Figure 2, transformants were streaked onto dropout plates lacking leucine, 
histidine, lysine, and tryptophan ("LHKW"), having glucose as a carbon source ("Glu"), 
and supplemented with 100 ug/ml 6-Azaruracil, an inhibitor of URA3 (Le Douarin et al, 
Nucl. Acids Res. 23:876-8 (1995)). These plates were incubated at 30°C for 12-48 hours. 
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Yeast streaks were next replicated onto the four dropout plates, LHKW/Glu+X Gal, 
LHKW/Galactose (Gal)+Raffinose (Raff)+X Gal, LHKWU/Glu, and LHKWU/Gal+Raff 
as shown above, and the results were scored after incubation at 30°C for 12-72 hours. 



Assays for the And Operation on Protein Inputs 

To carry out additional assays, an EcoKUXhoI-ended PCR product of human 
GAP1, amplified with primers 5'-GCCTGAATTCATGAAGGGGTGGTATCACGGA-3' 
(SEQ ID NO: 13) and 5-CCCGAACTCGAGCTA-CTTGACATCATTGGTTTTTG-3' 
(SEQ ID NO: 14), was cloned into pCWX200. EcoKUXhol-ended Cdc25(907-1589) (as 
in Figure 2) was also cloned into pCWX200. CWXY2 containing pEG202RasA i86 and 
pJGRaf(l-3 13) was then further transformed with either pCWX200, 
pCWX200CDC25(907-1589), or pCWX200GAP. Streaks of transformants pooled from 
>50 independent transformants were replicated onto LHKW/Glu+Xgal and 
LHKW/Gal+Raff+Xgal dropout plates, and these plates were incubated at 30°C for 2-3 
days. Cells on the former plate showed no blue color, and, on the latter plate, showed blue 
color of varied intensity. [3-galactosidase activity was measured by liquid assays in 
triplicates of cell pools (Estojak et al., Mol. Cell. Biol. 15:5820-9 (1995)), each of which 
contained >50 independent transformants. After inoculating LHKW/Gal+Raff medium at 
an OD 600 of 0.2 with washed cells grown to late log phase or saturation in LHKW/Glu 
liquid media, cultures were incubated at 30°C for an additional 5 hours, and 0- 
galactosidase assays were performed as described in Estojak et al. (Mol. Cell Biol. 
15:5820-29 (1995)). 
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Other Embodiments 
In an alternative to the above methods, the systems involving two 
reporter genes described herein may be used to select or screen for proteins that 
bind to two different protein binding sites. To carry out these screens, two 
different DNA fragments are inserted upstream of the two reporter genes (for 
example, a URA3 and a lacZ gene), and expression of each of the genes is assayed 
in a cell that also contains a gene expressing a candidate binding protein. This 
candidate binding protein may be expressed in its native state or as a fusion protein 
joined to an activation domain. 

Any of the methods described herein may be used to screen, without 
limitation, proteins, peptides, or aptamers. 

Other embodiments are within the claims. 
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